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Abstract — Recently it has been shown that closed-loop ref- 
erence models can have improved transient performance over 
their open-loop counterparts in model reference adaptive con- 
trol. This work highlights the trade-off between reducing the 
L-2 norms of the derivatives of the adaptive parameters and 
peaking phenomenon in the closed-loop reference model. The 
pervasiveness of closed-loop reference models in other adaptive 
systems and extensions to composite adaptive control are also 
reported. 



I. Introduction 

A universal observation in all adaptive control systems is a 
convergent, yet oscillatory behavior in the underlying errors. 
These oscillations increase with adaptation gain, and as such, 
lead to constraints on the speed of adaptation. The main 
obvious challenge in quantification of transients in adaptive 
systems stems from their nonlinear nature. In addition to 
such a challenge, most adaptive systems possess an inherent 
trade-off between the speed of convergence of the tracking 
error and the size of parametric uncertainty. In this paper, 
we provide a solution to this long standing problem, and 
overcome these trade-offs by proposing an adaptive control 
design that can ensure smooth transients, even with increased 
adaptation. 

Following stability of adaptive control systems in the 80s 
and their robustness in the 90s, several attempts have been 
made to quantify transient performance (see for example, 
[1)-[3|). The performance metric of interest in these papers 
stems from either supremum or L-2 norms of key errors 
within the adaptive system. In [l'| supremum and L-2 norms 
are derived for the model following error, the filtered model 
following error and the zero dynamics. In ||2] L-2 norms 
are derived for the the model following error in the context 
of output feedback adaptive systems in the presence of 
disturbances and un-modeled dynamics. The authors of [3| 
analyze the interconnection structure of adaptive systems and 
discuss scenarios under which key signals can behave poorly. 

Similar to [1 1, [2| we analyze the L-2 norm of the model 
following error. In addition, we propose a solution through 
a modified adaptive control design, where a closed-loop is 
introduced in the reference model. Through the use of the 
modified reference model and a projection algorithm in the 
adaptive law, we obtain a non conservative L-2 norm of the 
model following error which does not simply arise from the 
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direct integration of the Lyapunov candidate derivative. We 
also show how parameter adaptation can be realized without 
incurring the penalty of increased oscillations in the adaptive 
parameters. 

The basic premise of any adaptive control system is to 
have the output of a plant follow a prescribed reference 
model through the online adjustment of control parameters. 
Historically, the reference models in MRAC have been 
open-loop in nature (see for example, |0|, 0), with the 
reference trajectory generated by a linear dynamic model, 
and unaffected by the plant output. The notion of feeding 
back the model following error into the reference model was 
first reported in [6| and more recently in IITI- lfPfl . Denoting 
the adaptive systems with an Open-loop Reference Model as 
ORM-adaptive systems and those with closed-loop reference 
models as CRM-adaptive systems, our goal in this paper 
is to show how CRM-adaptive systems can be designed to 
alleviate the oscillatory property observed in ORM-adaptive 
systems, and obtain a satisfactory transient response. 

In addition to references |[T)-||3], transient performance in 
adaptive systems has been addressed in the context of CRM 
adaptive systems in ll7l- lfT4l . The results in Q, [8| focused 
on the tracking error, with emphasis mainly on the initial 
interval where the CRM-adaptive system exhibits fast time- 
scales. In (51 and iflOl . transient performance is quantified 
using a damping ratio and natural frequency type of analysis . 
However, assumptions are made that the initial state error is 
zero and that the closed-loop system state is independent of 
the feedback gain in the reference model, both of which may 
not hold in general. In this paper, with the only assumptions 
of matching conditions, it is shown as to how CRM adaptive 
systems can be designed to have improved transient response. 

In order to quantify transient performance that relates to 
frequency characteristics, we bring to bear a variety of tools 
including L-2 bounds on key system variables, projection 
algorithm in the adaptive law, an optimally chosen CRM, and 
fundamental principles of real and function analysis. These 
are used to derive L-2 bounds on the parameter derivatives 
which in turn provides an analytical basis for the reduction 
of oscillations. It is also shown that in general, a peaking 
phenomenon can occur with CRM-adaptive systems, which 
then is shown to be minimized through an appropriate design 
of the CRM-adaptive system. Extensive simulation results 
are provided, illustrating the conspicuous absence of these 
oscillations in CRM-adaptive systems in contrast to their 
dominant presence in ORM-adaptive systems. The results of 
this paper build on preliminary versions in tTTTI — lTl~3l where 
the bounds obtained were significantly more conservative. 



While all results derived in this paper are applicable to plants 
whose states are accessible for measurement, we refer the 
reader to [14| for extensions to output feedback. 

Another class of adaptive systems that have been ex- 
plored in the past where a noticeable improvement in 
transient performance was obtained is in the context of 
Combined/composite direct and indirect Model Reference 
Adaptive Control (CMRAC) fl3], EGD. While the results of 
these papers established stability of combined schemes, no 
rigorous guarantees of improved transient performance were 
provided, and have remained a conjecture [17]. We focus 
on this class of adaptive systems as well in this paper and 
introduce CRMs into the picture. We close this paper with a 
discussion of CRM and related concepts that appear in other 
adaptive systems as well including nonlinear adaptive control 
lfl~8l and adaptive control in robotics lTT9l . 

This paper is organized as follows. Section II contains the 
basic CRM structure with L-2 norms of the key signals in the 
system. Section III investigates the peaking in the reference 
model. Section IV contains the multidimensional states ac- 
cessible extension. Section V investigates composite control 
structures with CRM. Section VI investigates other forms of 
adaptive control where closed loop structures appear. 

II. CRM-Based Adaptive Control of Scalar 

Plants 

Let us begin with a simple scalar adaptive system, 

x p (t) = a p x p (t) + k p u(t) (1) 

where x p (t) e K is the plant state, u(t) el is the control 
input, a p G R is an unknown scalar and only the sign of 
k.p <G R is known. For the direct adaptive controller we 
choose a closed-loop reference model as 

x m (t) = a m x m (t) + k m r(t) - £(x(t) - x m (t)). (2) 

All of the parameters above are known and scalar, x m (t) 
is the reference model state, r(t) is a bounded reference 
input. We choose a m , £ < so that the reference model and 
the subsequent error dynamics will be stable. The open-loop 
reference model dynamics 



We define the parameter error 9(t) = 9(t) — 9*, where 



x rnV'l — a mX rn (t) + K m 7"(t) 



(3) 



is the corresponding true reference model that we actually 
want x p to converge to. 

The control law is chosen as 



u(t) = 9 T \t)(j>{t) 



(4) 



where we have defined 9 T (t) — [9(t) k(t)] and 
4> T (t) = [x p (t) r(t)] with an update law 

9 = -7Sgn(fc p )e0 (5) 

where 7 > is a free design parameter commonly referred 
to as the adaptive tuning gain and e(t) — x p (t) — x m (t) is 
the model following error. From this point forward we will 
suppress the explicit time dependance of parameters accept 
for emphasis. 



9* £ M 2 satisfies 9* T = 



The error dynam- 



ics can be derived to be of the following form: 



(6) 



e(t) = {a m +l)e + kp9 T (j>. 

Theorem Q] establishes the stability of the above adaptive 
system with the CRM: 

Theorem 1. The adaptive system with the plant in (JTJ, with 
the controller defined by ((4), the update law in (0 with the 
reference model as in (f2]l is globally stable, lirat^ao e(t) = 
0, and 



< 



1 



■i\ 



l e(0 )3 + 1^1^(0)1(0) 



(7) 



Proof: Consider the lyapunov candidate function 
V(e(t),*(t)) = ie a + iMr£. 

Taking the time derivative of V along the system directions 
we have V — (a, n + £)e 2 < 0. Given that V is posi- 
tive definite and V is negative semi-definite we have that 
V{e(t),9(t) < V(e(0), 1(0)) < 00. Thus V is bounded and 
this means in tern that e and 9 are bounded, with 



HeWHL < 2V(0). 



(8) 



Given that r and e are bounded and the fact that a m < 0, 
the reference model is stable. Thus we can conclude x m , and 
therefore x p , is bounded. Given that 9* is a constant we can 
conclude that 9 is bounded from the boundedness of 9. This 
can be compactly stated as e,x p ,9,9 £ Coo, and therefore 
all of the signals in the system are bounded. 

In order to prove asymptotic stability in the error 
we begin by noting that - f* V = V(e(O),0(O)) - 
V(e(t), 9{t)) < V(e(0), 0(0)). This in turn can be simplified 
as \a m + £\ /o e(t) 2 < V(0) V t > 0. Dividing by \a m + £\ 
and taking the limit as t — > 00 we have 



lellL < 



1 m + i\ 



(9) 



which implies (|7j. Given that e £ £ 2 fl C^ and e £ C^, 
Barbalat's Lemma is satisfied and therefore lim^oo e(i) = 

am ■ 

Theorem 1 clearly shows that CRM ensures stability of 
the adaptive system. We also have that lim^oo e(t) = 
and therefore we have that x m (t) — > x^,(t) as t — > 00. The 
question therefore is if any improvement is possible in the 
transient response with the inclusion of £. This is discussed 
in the following section. 

A. Transient Performance of CRM-adaptive systems 

The main impact of the CRM is a modification in the 
realization of the reference trajectory, from the use of a 
linear model to a nonlinear model. This in turn produces 
a more benign target for the adaptive closed-loop system to 
follow, resulting in better transients. It could be argued that 



the reference model meets the closed-loop system half-way, 
and therefore reduces the burden of tracking on the adaptive 
system and shifts it partially to the reference model. In what 
follows, we precisely quantify this effect. 

As Equation (|7]i in Theorem Q] illustrates, the L-2 norm of 
e has two components, one associated with the initial error 
in the reference model, e(0), and the other with the initial 
error in the parameter space, 9(0). The component associated 
with 0(0) is inversely proportional to the product j\£\ and the 
component associated with the initial model following error 
e(0) is inversely proportional to \£\ alone. Therefore, without 
the use of the feedback gain £ it is not possible to uniformly 
decrease the L-2 norm of the model following error. This 
clearly illustrates the advantage of using the CRM over the 
ORM, as in the latter, t = 0, 

While CRM-adaptive systems bring in this obvious ad- 
vantage, they can also introduce an undesirable peaking 
phenomenon. In what follows, we introduce a definition 
and show how through a proper choice of the gain I, this 
phenomenon can be contained, and lead to better bounds on 
the parameter derivatives. As mentioned in the introduction, 
we quantify transient performance in this paper by deriving 
L-2 bounds on the parameter derivative 9. For this purpose, 
we first discuss the L-2 bound on e and supremum bound for 
x m . We then describe a peaking phenomenon that is possible 
with CRM-adaptive systems. 

1) L-oo norm of x m : The solution to the ODE in (f2j) is 



.(*) 



.(0) 



^a m (t-r) 



JL m (t-r) 



r(r)dr 



(10) 



e(r)dT. 



The solution to the ODE in (01 is 
X° m (t) = e^*x o m (0) - 



,a m (t-T) 



For ease of exposition and comparison, x m (0) 
thus 

x m {t) = x° m {t) - I f e Q "^- T )e(T)dT. 



r(r)dT. (11) 
x° m (0) and 



(12) 



Denoting the difference between the open-loop and closed- 
loop reference model as Ax m — x m — x° m and using Cauchy 
Schwartz Ineqality on f Q e a " l ' t_ ' r - ) ||e(r)||(ir and the bound 
for ||e(i)||i 2 from ©, we can conclude that 



\Ax m (t)\\ < 



1 



V(0) 



|2a m | y \a m + £\ 



(13) 



Finally, using the fact that (x + y) 2 < 2x 2 + 2y 2 , x m (t) can 
be bounded as 

HMOllL <2IK(*)IIL + |f ' 2 ' Hi)i ' 



a m \ \a m + £\ 

{ ill 2 ^r(o)l(o) (U) 

\a m \ \a m +£\ 

We quantify the peaking phenomenon through the follow- 
ing definition: 



Definition 1. A system trajectory x(t) is said to exhibit a 
peaking phenomenon if ||aj(t)||.L 00 < ab + c where b > is 
a function of the initial conditions and fixed parameters in 
the system, a > is a free design parameter and c > is a 
function of the initial conditions of the plant and controller 
alone. 

It can be shown that x m (t), the trajectory of CRM, 
exhibits peaking phenomenon, as substantiated below. From 
the inequality in (fl4] i it can be seen that, the first term does 
not depend on any free parameter, and can be equated to c 
in Definition 1. By choosing 7 and £ to be of the same 
order of magnitude, the effect of the third term can be 
contained. However, the second term is proportional to £, and 
therefore can produce the peaking phenomenon as stated in 
Definition 1 as \£\ increases. While it is tempting to simply 
pick e(0) = as is suggested in (9), ifTol to circumvent this 
problem, it is not always possible to do so, as x(0) is not 
measurable because of any noise or disturbance that may be 
present or if the states are not all accessible for measurement. 

2) L-2 norm of k,9: With the bounds on e and x m in 
the previous sections, we now derive bounds on the adaptive 
parameter derivatives. From (|5} we can deduce that ||fc|| 2 = 
7 2 e 2 r 2 . Integrating both sides and taking the supremum of 
r we have 



|fc(T)|| 2 dr < 7 2 ||r(*)|| 2 ||e(t)|| 



Using the bound on ||e||z, 2 from (O we have that 



l*(*)ll 



< 



27 2 lkWlli.y(Q) 

\a m +£\ 



(15) 



(16) 



Similarly, from ((5) we can derive the inequality 



(r)fdr <2 7 2 ||e(i)|| 



e(r) 2 dT 



'27 2 ||Zm(t)||i 



(17) 

e(T) 2 dT. 







Using the bounds for |j e(4) || i,_^, in (HJ, ||e(i)||i 2 in (0 and 
II ^mW II Loo m 02L me b oun d in ( TTTb can be simplified as 



l*(*)l|L <47 : 



V(0)\\x° m (t)\ 



27 2 



U(0) 2 



4 7 2 



v(oy 



(18) 



\a m \ \a m + £\ 

From (fTST l it is clear that by increasing \£\ one can 
arbitrarily decrease the L-2 norm of k. The same is not 
true, however, for the L-2 norm of 9 given in ( TT8l . Let- 
ting £ approach negative infinity, the second term in ([T81 
approaches zero and the third term converges to a bound 
which is proportional to T^(0) 2 . When £ — 0, the second 
term becomes proportional to U(0) 2 and the last term in (fT8l 
becomes zero. Thus, the initial condition of the Lyapunov 
function will always be present in the bound for ||#||l 2 
whether one uses ORMs or CRMs. Focusing on the first- 
term, similar to (TT4V we once again see that its magnitude 
increases with r y 2 /£ and therefore one cannot simultaneously 



reduce the peaking phenomenon in x m (t) and at the same 
time control the speed of adaptation in 9. Therefore it still 
remains to be seen as to how and when CRM leads to an 
advantage over ORM. As shown in the following section, this 
can be demonstrated through the introduction of projection 
in the adaptive law and a suitable choice of £ and 7. 

B. Effect of Projection Algorithm 

It is well known that some sort of modification of the adap- 
tive law is needed to ensure boundedness in the presence of 
perturbations such as disturbances or unmodeled dynamics. 
We use a projection algorithm [20| with CRMs as 



e = Pro Jo(-7Sgn(A: p )e0,i 



(19) 



where 9(0), 9* € fl, with Q e R 2 a closed and convex set 
centered at the origin whose size is dependent on a known 
bound of the parameter uncertainty 9* . Equation ( fT9] l assures 
that 9(t) gn\/t>0 |20l. The following definition wiU be 
used throughout: 



©max = SUp |! 



(20) 



Beginning with the already proven fact that V < (a m + 
£)e 2 , we note that the following bound holds as well with 
the use of ([T9]i: 



V(t)<-2\a m +l\V+ l -^±^\k p \el dx . 
Using Gronwall-Bellman ll2D it can be deduced that 



(21) 



V(t) < V(0) - 



27 



© 



i -2\a m +e\t 



^©Lx (22) 



which can be further simplified as 



V{t) < -||e(0)|| e "^ 1 +— ©max (23) 



which informs the following exponential bound on e(t): 



||e(t)|| 2 <|| e (0)|| 2 e- 2 l a '»+^ + 



^e 



7 



(24) 



The discussions in Section [XT] show that with a projection 
algorithm, the CRM adaptive system is not only stable 
but satisfies the transient bounds in Q, ( TBi i. fl6l . (TTsT >. 
(l23l and <|24). The bounds in (Qj}, ^ and O leave 
much to be desired however, as it is not clear how the 
free design parameters £ and 7 can be chosen so that the 
bounds on ||A;||z, 2 and \\9\\l 2 can be systematically reduced 
while simultaneously controlling the peaking in the reference 
model output x m . Using the bounds in d23l and (l24T i we 
propose an optimal CRM design that does not suffer from 
the peaking phenomena, and show how the bounds in (TBI 
and (TToT i can be further improved. We also make a direct 
connection between the L-2 norm of the derivative of a 
signal, and the frequency and amplitude of oscillation in that 
signal. 



III. Bounded Peaking with CRM adaptive systems 

A. Bounds on x m 

We first show that the peaking that ||x m (t)|| was shown 
to exhibit in Section II-A can be reduced through the use 
of a projection algorithm in the update law as in JT9l , and 
a suitable choice of 7 and £. For this purpose we derive 
two different bounds, one over the time interval [0, ti] and 
another over [t 1; 00). 

Lemma 1. Consider the adaptive system with the plant in 
(HJ, with the controller defined by ©, the update law in (1191 
with the reference model as in (0. For all 8 > 1 and e > 0, 
there exists a time t\ > such that 



M*)||<*IM0)|| + e||r(t)|| 



||Mt)|| < *II*p(0)|| + elkWIUso + ^2^(0) 
V < t < ti. 



(25) 



Proof: The plant in (Q]) is described by the dynamical 
equation 

Xp = [CLjri 1 fcpt/JXp "T" KpKT 

where we note that (a m + k p 9) can be positive. This leads 
to the inequality 



\x P (t)\\ <|M0)||e 



(a m + |fc p |e m »)t 



+ 



3 (a m + |fc p |e m »)( t -r) |fcp| e max || r ( T )|| L=odr . 



For any 5 >1 and any e > 0, it follows from the above 
inequality that a t\ exists such that e( am+ l fep l e,mx ) t < 5 and 
f o * e (a m +|/c p |e m „)(t-r) 1^1 e max ^ T < e> v t < h. The bound 

on x m (t) follows from the fact that \\x m \\ < \\x p \\ + ||e|| and 
from ®. ■ 

Remark 1. The above lemma illustrates the fact that if t\ 
is small, the plant and reference model states cannot move 
arbitrarily far from their respective initial conditions over 

[0, h}. 

Lemma 2. For any a > 3 an x* < such that for all 

x < x* < 

e xa <\x\~^ Vi<i*<0. 

Proof: Exponential functions with negative exponent 

decay faster than any fractional polynomial. ■ 

We now derive bounds on x m (t) to t > t\. For this 

purpose a tighter bound on the error e than that in (O is 

first derived. 

Lemma 3. Consider the adaptive system with the plant in 
(03, with the controller defined by (|4). the update law in ( 1191 ) 
with the reference model as in (|2J, Given a time t\ > 0, there 
exists an £* s.t. 



|e(0)|| 



i;j 



, n ~ V2\a m + £\ \2 1 \a m +£\ 



&max (26) 



for all £ < £*. 



Proof: Substitution of t = t\ in d23l and using the 
fact that V(t) = -\a m + ^|||e(i)|| 2 , the following bound is 
obtained: 

| e(0) ||2 e -2|a m +£| tl 



'dT< 1 - 



2\a r , 



■*\ 



(27) 



27|am+^| 



e 



2 
max" 



Noting that Ve 2 l a m+^|*i = e l«m+^|ti ) an( j usm g the result 
from Lemma [2] we know that there exists an £* such that 

for all £ < £*, e ( a ™+ £ )* 1 < \a m + £\~ 1/2 . This leads to 



Theorem 2. Consider the adaptive system with the plant in 
(Q3, the controller defined by $4^, the update law in ( 1191 l with 
the reference model as in ©, with t\ chosen as in Lemma 
Q] and £ < £* where £* satisfies d261 l. It can then be shown 
that 



\\x m (i)\\ 2 t> tl <ci(t) + 



|e(o)|| 2 , KN^ieL, 



l\a r , 



(28) 



where 

c 1 ^2(\\x m \\L^ + \\x m {ti)\\e-\ a ^ t - t ^" 
Proof: The solution of (|2|i for t > t\ is given by 
||*m(t)||<|Klk 00 + ||x m (*i)||e-I^K t -^ 

e-l a ™K*- T )||e(r)||dr. 



Using the Cauchy Schwartz Inequality and (1261 1 from Lemma 
[3] we have that 

||a: m (*)lli>ti <|K|| ioo + || x m {ti)\\^ am](t - tl) 



\/ 2 l a m| 

l|e(0)|| 



\^V2|a TO + l| V 2 7|a m +^l/ ' 
for all £ < £*. Squaring leads to 



Remark 2. Through the use of a projection algorithm in 
the adaptive law, the exploitation of finite time stability of 
the plant in Lemma 1, and the use of the extra degree 
of freedom in the choice of I, we have obtained a bound 
for ||£ m (£)l| 2 <t in d25l > which is only a function of the 
initial condition of the plant and controller. Similarly for 
||:r m (£)|j 2 >t , we have derived a bound in (f28b which is 
once again a function of the initial condition of the plant 
and controller, but also dependent on a component which is 
proportional to I^I^O 2 ^- The most important point to note 
is that unlike (fl4] i. the bound on x m in (|28| | is no longer 



proportional to 



That is, after a time tx, which is 



arbitrary, much tighter bounds can be derived for x m . This 
implies that even for large \£\, an appropriate choice of the 
adaptive tuning parameter 7 can help reduce the peaking in 
the reference model. This improvement was possible only 
through the introduction of projection and the use of the 
Gronwell-Bellman inequality. 



B. Bounds on parameter derivatives and oscillations 
We now present the main result of this paper. 

Theorem 3. The adaptive system with the plant in ([TJ, the 
controller defined by @, the update law in d!91 l with the 
reference model as in (|2), with t\ chosen as in Lemma [7] 
and £ < £* where £* is given in Lemma\3\and 7 = |£|, the 
following bounds are satisfied for all 7 > 1: 

'||fc|| 2 dr<(||e(0)|| 2 + |fc p |e 2 J||r(i)|| Loo 



||^|| 2 dr<(||e(0)|| 2 + |fc p |e 2 J C2 

+ (|| e (o)|| 2 + |fc p |eL t / '• '■' 



\£\ 1 



(29) 

where 02,03, C4 are independent of 7 and £, and are only a 
function of the initial conditions of the system and the fixed 
design parameters. 

Proof: Using (TT~5T > and (|26T >. together with the fact that 
7 = |^|, we obtain the first inequality in d29l . To prove the 
bound on 9 we start with (fTTI i. and note that 



7 2 / e(r) 2 dr <(||e(0)|| 2 + |fc p |eL x ) 



(30) 



Using the bound in d30b and setting c 2 = ||^m,(i)|| 2 >t from 
we have the first term in the bound on 9 in 
We note from (124-b and Lemma [2] that 



|e(i)|| < 



e(0) 2 



I ftp] t^max 



vt > t\. 



This together with (T30b leads to the second term in the bound 
on 9 in J291 . Therefore, 02,03 and C4 are independent of 7 
and £. ■ 

Remark 3. From the above Theorem it is clear that if 7 and 
\£\ are increased while holding 7 = \£\, the L-2 norms of 
the derivatives of the adaptive parameters can be decreased 
significantly. Two important points should be noted. One is 
that the bounds in d29l i are much tighter than those in (fT8l . 
with terms of the form 7 2 /£ no longer present. Second, these 
bounds are significantly smaller than those corresponding to 
their ORM-counterparts. 



We now connect the bounds in (1291 to the oft-observed 
frequency characteristics. For this purpose, we use a standard 
property of fourier series and continuous functions [E2l . [1231 
summarized in Lemma |4] below: 

Lemma 4. Consider any periodic signal /(t) e M 
over a finite interval T=[t 1 ,t 1 +r] with a period 
t. The Fourier coefficients of f(t) are then given by 
F(ri) = / T ftye-^Wdt If f(t) G C\ given e x > 0, there 
exists an integer Nx such that 



(1) 



f(t)-EnL- Nl F(n)e^^ <ex, V* G T. 

If in addition f(t) € C 2 , then for all ti > there exists an 
integer N2 such that 



(ii) 



&/(*)- ££-*>(n)F(n)e* 



i(n)t 



< £ 2 ,VtGT. 



The above lemma is now used to illustrate the connection 
between the L-2 norm on 9 and the reduced oscillations that 
are observed in the simulation sections. 

We define g{t) = 9(t) - 9 Q t where 

9(t 1 +r)^9(t 1 ) 

fro = ■ 

T 

It can be seen that g(t) is periodic over \b\,t\ + r]. Hence, 
from Lemma 2Ji), it follows that 



ATi 



fl (*)- E G f (n)e iw W* 

n=—Ni 



< ei, Vi G T, 



(31) 



where G(n) are the Fourier coefficients of g(t). Since r is 
continuous, it follows that 9 E C 1 . Hence, it follows that 



A'i 



j(t)- ^ »G(n)w(n)e* 

n=-JVi 



;(n)« 



<e 2 , VteT. (32) 



Defining A(t) as the slack variable corresponding to the 
inequality d32b . it follows that 



9{t) = 9 + J2 ^(n)w(n)e iw(n)t + A(i) Vt G T. (33) 

n=-iVi 

Combining ( f33b with the second equation in d29b . it follows 
that 

2 

<M 

n=-N ± 



rti+r 



JVi 



/ 6>o + JI *G(n)w(7i)e l "(") t + A(t) 

Vt G T where 

M=(||e(0)|| 2 + |fc p |9 




Since A(£) is arbitrarily small and 9q can be made arbitrarily 
small as well for a large r, as M decreases, the quantity 
|£r(n)w(n)| decreases as well. That is, G(n) decreases as 
to(n) increases, implying that the high frequency content 
is minimized as the bound M is minimized. This forms 
the analytical basis for the reduction in high frequency 
oscillations with CRM-adaptive systems. 

C. Simulation Studies for CRM 

Simulation studies are now presented to illustrate the 
improved transient behavior of the adaptive parameters and 
the peaking that can occur in the reference model. For 
these examples the reference system is chosen such that 
a m = — 1, k m = 1 and the plant is chosen as a p = 1, k p = 2. 
The adaptive parameters are initialized to be zero. Figures 1 
through 3 are for an ORM adaptive system with the tuning 
gain chosen as 7 G {1,10,100}. Walking through Figures 
1 through 3 it clear that as the tuning gain is increased 
the plant tracks the reference model more closely, at the 
cost of increased oscillations in the adaptive parameters. 
Then the CRM is introduced and the resulting responses are 



shown in Figures 4 through 6, for 7 = 100, and i?=-10, - 
100, and -1000 respectively. First, it should be noted that 
no high frequency oscillations are present in any of these 
cases, and the trajectories are smooth, which corroborates 
the inequalities d29l > in Theorem [3] As the ratio \£\ /j 
increases, as illustrated in Figures 4 through 6, the reference 
trajectory x m starts to deviate from the plant output x p , with 
the peaking phenomenon clearly visible in Figure 6 where 
\£\ /j = 10. This corroborates our results in section III as 
well. 
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Fig. 1. Trajectories of the ORM adaptive system 7 = 1. 
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Fig. 2. Trajectories of the ORM adaptive system 7 = 10. 
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Fig. 3. Trajectories of the ORM adaptive system 7 = 100. 



IV. CRM for States Accessible Control 

In this section we show that the same bounds shown pre- 
viously easily extend to the states accessible case. Consider 
the n dimensional linear system 



Ax v + BAu 



(34) 
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Fig. 4. Trajectories of the CRM adaptive system 7 = 100, 
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Fig. 5. Trajectories of the CRM adaptive system 7 = 100, 
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Fig. 6. Trajectories of the CRM adaptive system 7 = 100, 



-1000. 



with B known, A. A are unknown and A is diagonal and 
positive. The reference model is defined as 

Xm = A m X m + Br - Le. (35) 

The control input is defined as 

u = Qx p + Kr. (36) 

It is assumed that there exists 6* and K* such that 
A + BAQ* = A m 
AK* = I 

and the parameter errors are then defined as 6 = — 9* 
and K = K — K*. Defining the error as e = x p — x m , the 
update law for the adaptive parameters is then 

e = Proj^(-ri? T Pex^e) 

k = ?m] n2 {-TB T Per T ,K) 

where P — P T > is the solution to the Lyapnov equation 

(Am + L) T P + P(A m + L) = -Q which exists for all 



(37) 



Q = Q T > 0. With a slight abuse of notation the following 
definition is reused from the previous section, 



SUp ||8||f - ©max 

e,e*e£2i 



and 



sup 



\K\\ F ±K« 



(38) 

where \\-\\f denotes the Frobenius norm. The adaptive system 
can be shown to be stable by using the following Lyapunov 
candidate, 



V(t) 



"Pe + Tr^r^e) + Tri^T^K) 



where after differentiating we have that V < —e T Qe. We 
choose L and V in a special form to ease the analysis in the 
following sections. 



Assumption 1. The free design parameters are chosen as 

(39) 



f — T-'raxn 



where 7 > and g < 0. 

Assumption 1 allows us to choose a P = l/2I nxn in 
the Lyapunov equation and therefore Q = —gl nxn - Using 
these simplification the Lyapunov candidate derivative can 
be bounded as 



V(t)<-\g\\\e(t)\\ 2 , 
and by direct integration we have 

no) 



l|e(t)|| 



< 



1.91 



(40) 



(41) 



Using the Gronwall-Bellman Lemma as was previously used 
in d2l1)-(l24l). we can deduce that 



V(t) < \\\e(0)\ 



2 e -2\g\t 



1 



e 2 



Ki 







(42) 



Lemma 5. For all e > and 6 > 1 there exists a t% such that 
the plant and reference model in (134t and ( 135b respectively 
satisfy the bounds in d251 l with £2 replacing t\. 

Lemma 6. Consider the adaptive system with the plant 
in d341 >, the controller in ( 1361 ), the update law in d371 l, the 
reference model as in ( 135b and T and L parameterized as in 
Assumption^ Given a time ti > 0, there exists a g* s.t. 



2 dr< 



|e(0)|| 



ie 2 

^ max 



K 2 

max 



V2\g\ V - '/ 

for all g < g*. 

Proof: From (T55t we have that 

V(t 2 ) < i|| e (0)|| 2 e- 2 ^ + 1 (eL x + K 2 wd> 

Using the above bound and integrating — V in 
to 00 and dividing by \g\ leads to 



(43) 



from £2 



l dr< 



-2|fl|t 2 



1 



2|5l 



7131 



(9 



2 
max 



K l 



(44) 



Taking the square root, noting that ve^W = e I 9 '* 2 , and 
using the result from Lemma [2] we know that there exists 
an g* such that for all g < g* , e^l* 2 < \g\~ 1/2 . ■ 

Theorem 4. Consider the adaptive system with the plant 
in d341 l, the controller in ( 1361 ), the update law in (|37| l, the 
reference model as in (1351 l, T and L parameterized as in 
Assumption [7J with t-i chosen as in Lemma \5\ and g < g* 
where g* is given in Lemma |6] It can be shown that 

\\x m {t)ft>t, < <%(*) + 2 ( ^# + l)^H0)|| 2 



V7lffl 7/a 2 v 



2 ) 

max J 



(45) 



where 



(% 



= 2 



e A m t < 0ie 



•'7i 

(lot 



|a3 m (*2)||aie 



— a 2 (t — t 2 



with ai, a2 > 0. 

Proof: The existence of 01,02 > such that 
e -4 m * < ai e a2t follows from the fact that A m is Hurwtiz 
Consider the dynamical system in (l35T l for t > t%, 



||a:m(t)||t>t a <|km|Uoc 

+ ||L||ai 



e -a2(4-r)|| e ( r )|| dr _ 



Using Cauchy Schwartz along with (l43l in Lemma [6] we 
have that 



\\Xm(t)\\t>t 2 <\\X°J\ L 

\\L\\ 



" /'||e(0)|| 



/02 



if 2 . 



/27^ 



V2 



.91 



Squaring and using the fact that L = —A m 
thus ||L|| < ||.A m || +jwe have that 



l\9\ 
+ gl n > 



and 



\\Xm(t)\\ 2 t>t <C 5 (t) + 



|An||+ff) 2 a 



«2 



-|]e(0)]| 



2(||A m ||+ 5 ) 2 a 



7l3l 
Inequality (05J follows since (||A r , 

2M 2 - 



«2 



i(e 



2 

max 



Kl 



< 211 A, 



Remark 4. Just as in the scalar case, we have derived a 
bound for ||x m (t)|| 2 >f9 which is once again a function of 
the initial condition of the plant and controller, but also 
dependent on a component which is proportional to I5I/7. 



Therefore, by choosing 



Isl 



1 with 7 > we can have 



bounded peaking in the reference model. 

Theorem 5. The adaptive system with the plant in ( 1341 i, 
the controller in (1361 l, the update law in ( 1371 l, the reference 
model as in ( |35l l, V and L parameterized as in Assumption 
[7] with £2 chosen as in Lemma \5\ g < g* where g* is given 



in Lemma [6] and 7 chosen such that 7 = |<jt| the following 
bounds are satisfied for all 7 > 1; 



'2 



\\KfdT <\\B\\ (||e(0)|| 2 + X 2 fl , + e 2 fl ,) ||r(t)||j 

o 

||e|| 2 dr<||B|!(||e(0)|! 2 + e 2 M + if 2 J- 




(46) 

where Cq,c-j,c%, Cq are independent 0/7 and g, and are only 
a function of the initial conditions of the system and the fixed 
design parameters. 

Proof: The proof follows the same steps as used to 
derive the bounds in Theorem [3] ■ 

Remark 5. It should be noted that if 7 and \g\ are increased 
while holding 7 = \g\, the L-2 norms of the derivatives of 
the adaptive parameters can be decreased significantly. 

V. CRM Composite Control with Observer 
Feedback 

In this section, we show that the tools introduced to 
demonstrate smooth transient in CRM-adaptive systems can 
be used to analyze CMRAC systems introduced in lfT31 - 
IT17"1 . As mentioned in the introduction, these systems were 
observed to exhibit smooth transient response, and yet no 
analytical explanations have been provided until now for this 
behavior. Our focus is on first-order plants for the sake of 
simplicity. Similar to Section [IVJ all results derived here can 
be directly extended to higher order plants whose states are 
accessible. 

The CMRAC system that we discuss in this paper differs 
from that in [ 15 1 and includes an observer whose state is fed 
back for control rather than the plant state. As mentioned in 
the introduction, we denote this class of systems as CMRAC- 
CO and is described by the plant in (HJ, the reference model 
in (0, an observer as 

x (t) = l(x - x p ) + (a m - k p 0)x o (t) + k p u(t), (47) 

and the control input is given by 

u = 9x + k*r. (48) 

In the above k p is assumed to be known for ease of 



exposition. The feedback gain £ is chosen so that 
ge=a m +e+\k p 9*\ < 0. 



(49) 



Defining e m — x p — x m and e = x — x p , the error dy- 
namics are now given by 



Cm(') — ( a m + f-) e m + KpUX + kptl e 

e (t) = (o m + t)e - k p 9x . 



(50) 



where 9 = 9-9* and 9 = 9-9* with 9* satisfying 
a p + k p 9* = a m and k p k* = k m . The update laws for the 



adaptive parameters are then defined with the update law 
9 = Proj n (-7Sgn(fc p )e m a; , 9) - rjeg 
9 = Proj n (7Sgn(fc p )e a; , 9) + r\eg (51) 

e e = 9-6 

where 7,7/ > are free design parameters. As before we 
define the bounded set 



6 max = max< sup ||0||, sup 
le,e*en e,e*£Q 



(52) 



We first establish stability and then discuss the improved 
transient response. 

A. Stability 

The stability of the CMRAC-CO adaptive system given 
by O, ©, (l47b-(T5Tb can be verified with the following 
Lyapunov candidate 



V(t) 



1 



2,2 



^d§ 2 

7 



7 



which has the following derivative 

V < 9ee 2 m + g e e 2 - 



V\k p \ 

7 



(53) 



(54) 



Boundedness of all signals in the system follows since 
ge < 0. From the integration of (l54l we have {e m ,e ,eg} £ 
C 00 C\L 2 and thus limt^oole™, e , eg} = {0, 0, 0}. Using the 
Gronwall-Bellman Lemma as was previously used in (ETt - 
(l24l . we can deduce that 

V(t) < \ (e m (0) 2 + e o (0) 2 ) e- 2 '^ + M 9 f^ (55 ) 

It should be noted that the presence of a non-zero £ is crucial 
for stability, as gg cannot be guaranteed to be negative if 

£ = 0. 



B. Transient performance of CMRAC-CO 

Similar to Sections II and III we divide the timeline into 
[0,^3] and [£3,00), where £3 is arbitrarily small. We first 
derive bounds for the system states over the initial [0, £3] 
in Lemma [7] bounds for the tracking, observer, parameter 
estimation errors e m , e and eg over [£3,00) in Lemma [8] 
bounds for x over [£3, 00) in Theorem[6] and finally bounds 
for the parameter derivatives 9 and 9 in Theorem [8] 

Lemma 7. Consider the CMRAC-CO adaptive system with 
the plant in (Q3, with the controller defined by ( I48K the 
update law in ( 1511 ) and with the reference model as in (EJ. 
For all 5 > 1 and e > 0, there exists a time £3 > such that 



\\x P {t)\\ < S\\x p (0)\\ 
\\ Xo (t)\\ < 6\\x p (0)\\ 
V < £ < £3. 



|r(t)lk 



a/2V(0) 



(l + e)y/2V(fi) 



(56) 



Proof: The plant in (OJ with the controller in (l48l l can 
be represented as 

x p = (a p + kp9)x p + k p (r + 9e ) 

where we note that (a p + k p 9) can be positive. This leads to 
the inequality 

\\x p (t)\\ <\\x p (0)\\e^ + ^ & ^ t + f e (. a p+\^\e>^)(t-r), 

Jo 

\k p \ (e max ||r(r)|| + e max ||e (r)||) dr. 

For any S >1 and any e > 0, it follows from the above 
inequality that a £3 exists such that e( a,p+ \ kp \ em "' >t < 8 
and f Q ' e( a *+l fc »l e ««X*- T ) |fc p | e max dr < e, < £ < £3 given 
<5 > and e > 0. From the structure of the Lyapunov 
candidate in d53l and the fact that V < we have that 
ll e o(0IUoo — \/2U(0). The bound on x (£) follows from 
the fact that ||x || < ||x p || + ||e ||. ■ 

Lemma 8. Consider the adaptive system with the plant in 
(fl}, the controller in d48l ), the update law in d51l) , and the 
reference model as in (Ejl. Given a time £2 > 0, f/;ere exists 
a g* s s.t. 



/ e m z d,T < -=- — ■ h W — -^-9™., 



„ % <^M + JJHft.CT 



V2 



.'/» 



7l3e| 



e e 2 dr < 



V7Ve m (0) 2 + gOg 



e„ 



/or a// gg < <7g. 

Theorem 6. Consider the adaptive system with the plant 
in ([TJ, the controller in ( 1481 ), the update law in (1511 ), the 
reference model as in (EJ, with £3 chosen as in Lemma [7| 
flwfl? ge — g"g where gg is given in Lemma \E\ It can be shown 
that 

\l\ 2 
IM*)||?>t, < ci (£) + ^2>l(e m (0) 2 + e (0) 2 ) 

\m 
1 «i 2 11 1 



iw\ 



Ua 4 e 



2 

max 



(58) 



where 



C10 = 2 (a 3 |fep| ||r||t 
ae = a m + fc p ee 



M£ 3 )||e ae(t -* 3) 



and 



3 a e (t-r) dr < 



3 2a 9 (t-r) dr < 



«3 



«4 



w/f/i < aj < 00, i G {3, 4}. 

Proof: Given that limt-^oo eg(t) =0we have from (l49b 
that limt-j.00 a# = a m . Thus limj^oo e ae * = 0. Therefore, 



II) 



03,04 < oo. Consider the dynamical system in J47] i for t > 

h- 

\\x (t)\\t>t 3 <\\x (t 3 )\\e a ^-^ 

+ / e^(^)(|^|||e (r)||+|fc p |||r(r)||)dr. 
Jt 3 

Using Cauchy Schwartz and Lemma [8] as before we have 

IM*)||t>t 3 <a 3 |*p| \\r\\ Loc + || x (i 3 )||e- |a "' l(t - f3) 
Ve m (0) 2 + e o (0) 2 



V2 



96\ 



+ \£\ yfala A J— p-rQmax- 

VTlsei 

Squaring and using the inequality (a+6) 2 < 2a 2 + 2b 2 twice, 
we have our result. ■ 

Corollary 7. For the system presented in Theorem [6] setting 
7 = \ge\ and taking the limit as £ — > — oo the following 
bound holds for x (t) 

lim ||a; (i)|| 2 > f 3 < ci (i) + 2^1 (e m (0) 2 + e o (0) 2 ) 



+ |fc p |4a 4 e 2 fl;c . 



(59) 



Theorem 8. The adaptive system with the plant in (Q3, the 
controller defined by (148b . the update law in ( 15 11 1 w/f/z the 
reference model as in (|2), with £3 chosen as in Lemma [7| 
one/ 50 < g*^ where g* B is given in Lemma [5] and 7 chosen 
such that 7 = \g$\ the following bounds are satisfied for all 
7 > 1: 



< :,u2 dT<aand I \\9\\ 2 dr < a (60) 

*3 



with 



<* = (^ + i^|) ( e ™(°) 2 + e o(0) 2 + |*p| 6 2 „) Cll 

where en « independent 0/7 anc/ 59, ant/ W onty a function 
of the initial conditions of the system and the fixed design 
parameters. 



Remark 6. Note that 



i 2 
lim —=■ = !. 



£^-00 



(Jo 



Thus for large \£\ the truncated L-2 norm of 8 is simply a 
function of the initial conditions of the system and the tuning 
parameter 77. 

C. Robustness of CMRAC-CO to Noise 

As mentioned earlier, the benefits of the CMRAC-CO is 
the use of the observer state x rather than the actual plant 
state x. This implies that the effect of any measurement noise 
on system performance can be reduced. This is explored in 
this section and Section [V-DI 



Suppose that the actual plant dynamics is modified from 
CD as 

x a (t) — a p x a (t) + k p u(t), x p (t) = x a (t) + n(t) (61) 

where n(t) represents measurement noise. For ease of expo- 
sition, we assume that n(t) is bounded and deterministic. 
This leads to a set of modified error equations 



e-m(i) =(o m + £)e m + k p 9(t)x + k p 0*e o + £n(t) 



(62) 



e (t) =(o m +£- k p 9*)e - k p 6{t)x - £n{t) 

Theorem 9. The adaptive system with the plant in (16 U , the 
controller defined by (148l l, the update law in (15 11 1 with the 
reference model as in (O, and £ chosen such that a m + £ + 
2 \k p \ \9*\ < 0, all trajectories are bounded and 



V(t)<^(e m (0) 2 + U0) 2 )e 



7 """ 4| 3 „| 2 



-2|fl„|t 

I (63) 



|n(«)|| 



where 



g n ^a m + £ + 2\k p \\8*\. (64) 

Proof: Taking the time derivative of V in ( l53l results 
in 

^<gn(||e m || 2 + || eo || 2 )-|fc p |^ 

+ ||n(t)||||e m (t)|| + ||n(t)||||e (t)||. 
completing the square in ||e m ||||n|| and ||e ||||n|| 

^<-|g„|/2(|| em |r + || eo || 2 )-|^|^ 2 

-| ff „|/2(|| em ||-l/|.g„|||n(t)||) 2 
-| ff „|/2(||e ||-l/|. 9n |||n(t)||) 2 
+ l/(4| ffn |)||n(t)|| 2 . 

Neglecting the negative terms in lines 2 and 3 from above 
and the term involving eg we have that 

^<-|5«|/2(||e m || 2 + ||e || 2 ) + l/(4|.g„|)||n(t)|| 2 , 

and in terms of V gives us 

1 \kp\ \g n \ (?j2 1 n2\ 1 II AAII2 



V<-\g n \V + 



(e 2 + e 2 ) + ^-A\n(t)\\ 

v ' 4| ff „| 



Using the Gronwall-Bellman Lemma and substitution of 
V(t) leads to the bound in (|63]l. ■ 

D. Simulation Study 

For this study a scalar system in the presence of noise is to 
be controlled with dynamics as presented in (loTl l. where n(t) 
is a deterministic signal used to represent sensor noise. n(t) 
is generated from a Gausian distribution with variance 1 and 
covariance 0.01, deterministically sampled using a fixed seed 
at 100 Hz, and then passed through a saturation function with 
upper and lower bounds of 0. 1 and -0. 1 respectively. For the 
CMRAC-CO systems the reference model is chosen as (f2]i 
with the rest of the controller described by (|471 - (|5"TV 



open-loop 
closed-loop 




Fig. 7. (top) reference model trajectories x m , (middle) state x, and 
(bottom) model following e. 



The CMRAC system used for comparison is identical to 
that in [15|. For CMRAC the reference dynamics are now 
chosen as x° n in (O, the observer is the same as CMRAC-CO 

•Hz). 

Further differences arise with the control law being chosen 

as 



6xp + k*r 



The open-loop error e° 



-x° m updates the direct adaptive 



component, with the regressor becoming x v instead of x for 
both 9 and 9 update laws: 



Fig. 8. (top) Control input u, (middle-top) discrete rate of change 
of control input Au/At, (middle-bottom) adaptive parameter 9(t) and 
(bottom) adaptive parameter 9(t). 

TABLE II 

Simulation Parameters 

Paramater Value 



e = Pi"ojf 2 (-7Sgn(fcp)e^Xp, 9) - rje e 
V = P™}n(l s E n ( k p) e oX P , 9) + Tje e . 



(66) 



The complete CMRAC and CMRAC-CO systems are given 
in Table I with the design parameters given in Table II. 
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-1 
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-10 


7 
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100 
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TABLE I 
Test Case Equations 




Paramater 


CMRAC 


CMRAC-CO 


Reference Dynamics 
Observer Dynamics 
Reference Error 
Observer Error 
Input 
Update Laws 


Equation (3) 
Equation J471 

p° — t _ T o 
Go — Xq Xp 

u = dxp + r 
Equation J66l 


Equation {2) 
Equation J471 

Em — %p Xfn 
6o — — Xq Xp 

u = 9x + r 
Equation 0D 



over to 15 seconds, with x m , x, and e m indicated in Figure 
|2l and u, Au/At, 9 and 9 indicated in Figure[8] The addition 
of sensor noise makes the output x p not differentiable and 
therefore we use the discrete difference function A to obtain 
the discrete time derivative of the control input, where 



Am a u(t i+ i) -u{ti) 



At 



«+i 



t; 



t 



i+1 



u = 0.01. 



The simulations have two distinct regions of interest, with 
Region 1 denoting the first 4 seconds, Region 2 denoting 
the 4 sec to 15 sec range. In Region 1, the adaptive system 
is subjected to non-zero initial conditions in the state and 
the reference input is zero. At t = 4 sec, the beginning of 
Region 2, a filtered step input is introduced. Figures [7] and [8] 
illustrate the response of the CMRAC-CO adaptive system 



In both cases, the resulting performance is compared with 
the classical CMRAC system. The first point that should be 
noted is a satisfactory behavior in the steady-state of the 
CMRAC-CO adaptive controller. We note a significant dif- 
ference between the responses of CMRAC-CO and CMRAC 
systems, which pertains to the use of filtered regressors in 
CMRAC-CO. An examination of Au/At in Figure [8] clearly 
illustrates the advantage of CMRAC-CO. 



E. Comments on CMRAC and CMRAC-CO 

As discussed in the Introduction, combining indirect and 
direct adaptive control has always been observed to pro- 
duce desirable transient response in adaptive control. While 
the above analysis does not directly support the observed 
transient improvements with CMRAC, we provide a few 
speculations below: The free design parameter £ in the 
identifier is typically chosen to have eigenvalues faster than 
the plant that is being controlled. Therefore the identification 
model following error e, converges rapidly and 9(t) will have 
smooth transients. It can be argued that the desirable transient 
properties of the identifier pass on to the direct component 
through the tuning law, and in particular eg. 

VI. CRMs in other Adaptive Systems 

While CRMs can be traced to in the context of 
direct model reference adaptive control, such a closed loop 
structure has always been present in, adaptive observers, 
tuning function designs, and in a similar fashion in adaptive 
control of robots. These are briefly described in the following 
sections. 



doti- 



-Twk- The Ci are free design parameters 



where a lk m 

that arise in the definition of the on as defined in d79l in the 
Appendix. Notice that the — a act in the same way as the £ 
in the simple adaptive system first presented in the reference 
model in (0. They act to close the reference trajectories with 
the plant state. The above system also results in similar L-2 
norms for the z error state. Consider the Lyapunov candidate 

' ' ■••'- ' ' (71.) 



V(z(t),9(t)) = ~z T * 



-0 T T-H 
2 



which results in a negative semidefinite derivative V < 
—Co\\z\\ 2 where c$ = mini<i<„ (c,), Thus we can integrate 
— V to obtain the following bound on the L-2 norm of z 

no) 



i*(*)ii 



< 



CO 



(72) 



It is addressed in lfl8l §4.4.1] that while it may appear 
that increasing cq uniformly decreases the L-2 norm of z, 
choosing a to be a large can result in large z(0). The authors 
then provide a method for initializing the z dynamics so that 
z(0) = 0. We have already discussed why this may not be 
possible in a real system. 



A. Adaptive Backstepping with Tuning Functions 

The control structure presented here is identical to that 
presented in |PT8"1 §4.3]. Consider the unknown system 

±i = x 2 + ipi(xi) T 0* 
x\ = X 3 + ip 2 {xi,x 2 ) T 0* 

i„_i = x„ + ip n -i(xi,...,x n -i) T 9* 
±i = ji(x)u + (p n (x) T 9* 



(67) 



where 9* is an unknown column vector, f3(x) is known and 
invertible, the ifi are known, x is the state vector of the scalar 
Xi and the goal is to have y — x\ follow a desired n times 
differentiable y r . The control law propped in ifTll is of the 
form 

^(«« + ^) (68) 



(3(x) 



with an update law 



9 = TWz 



(69) 



where Y — Y T > is the adaptive tuning parameter, z is the 
transformed state error, and W = T n (z,9) with n and the on, 
1 < i < n defined in ( f79b in the Appendix along with the rest 
of the control design. The closed loop system reduces to 



z = A z (z,6,t)z + W(z,6,i)9 
where 9 = 9-9* and 
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(70) 
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B. Adaptive Control in Robotics 

The control structure presented here is taken directly from 
[19. §9.2]. Consider the dynamics of a rigid manipulator 



H(q)q + C(q,q)+g(q)=T 



(73) 



where q is the joint angle, and r is the torque input. It is 
assumed that the system can be parameterized as 

Y(<1, 4, Qr, Qr)a = H(q)q r + C(q, q)q r + g(q) (74) 

where q r is a twice differentiable reference signal, Y is 
known and a is an unknown vector. The control law is chosen 

as 



t = Yd — kds and a = —YY s. 



(75) 



Then, defining the desired dynamics trajectory as qd, the 
reference dynamics of the system are created by 



q r =qd~ M 
where q = q — qd and 

s = q — q r = q + Xq. 



(76) 



(77) 



The stability of the above system can be verified with the 
following Lyapunov candidate, 

V = ~(s T Hs + a T Y- 1 a) . 

Differentiating and using the property that H = C + C T we 
have that 



V = -s T k d s. 



(78) 



We note that A has a similar role in this control structure 
as the £ in the CRM. The desired trajectory is qd (like x° n 
in our examples), however the adaptive parameter is updated 
by the composite variable s instead of directly adjusted by 
the true reference error. 



We now conjecture as to why closed-loop reference mod- 
els have not been studied in direct adaptive control until 
recently. In the two cases of nonlinear adaptive control, 
closed reference trajectory errors are used to update the 
adaptive controller. This is performed in the tuning func- 
tion approach through the selection of the a and in the 
adaptive robot control example with A and the creation of 
the composite variable s. In both cases the stability of the 
system necessitates the introduction of these variables. In 
contrast, in model reference adaptive control, stability is 
derived from the inherent stability of the reference model and 
hence any addition of new variables becomes superfluous. 
When no reference model is present, closing the loop on the 
reference trajectory becomes necessary. With the recent focus 
on improving transients in adaptive systems, CRM now has 
a role in MRAC. And as pointed out in this paper, improved 
transients can result with CRM without introducing peaking 
by choosing the ratio \£\ /j carefully. 

VII. Conclusions 

An increasingly oscillatory response with increasing adap- 
tation gain is a transient characteristic that is ubiquitous in 
all adaptive systems. Recently, a class of adaptive systems 
has been investigated with closed-loop reference models 
where such oscillatory response can be curtailed. In this 
paper, a detailed analysis of such adaptive systems is carried 
out. It is shown through the derivation of analytical bounds 
on both states of the adaptive system and on parameter 
derivatives that a phenomenon of peaking can occur with 
CRMs and that this phenomenon can be curtailed through a 
combination of design and analysis. The resulting adaptive 
systems have improved transient characteristics with reduced 
or no oscillations even as the adaptation gains are increased. 
CRMs are shown to be implicitly present in other problems 
including adaptive nonlinear control and a class of problems 
in robotics. 
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Appendix A 
Tuning Function Parameter Definitions 



z % =Xi - y£* 1} - cxi-i 



Cti= — Zi-l - CiZi - w. 
i-1 






(79) 



A:=2 



n =tj_i + WiZi 

i-1 
W 



~SZ <Pk 



fe=l 



dx k 



